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Disclosed is a process for forming a normalized genomic DNA library from an environmental sample by (a) isolating a genomic DNA 
population from the environmental sample; (b) at least one of (i) amplifying the copy number of the DNA population so isolated and (ii) 
recovering a fraction of the isolated genomic DNA having a desired characteristic; and (c) normalizing the representation of various DNAs 
within the genomic DNA population so as to form a normalized library of genomic DNA from the environmental sample. Also disclosed 
is a normalized genomic DNA library formed from an environmental sample by the process. 
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PRODUCTION AND USE OF NORMALIZED DNA LIBRARIES 

FIEL'D'OFTHEINVENTION 

5 The present invention relates to the field of production and screening of 

gene libraries, and more particularly to the generation and screening of normalized 
genomic DNA libraries from mixed populations of microbes and/or other organisms. 

BACKGROUND OF THE INVENTION 

There has been increasing demand in the research reagent, diagnostic 
10 reagent and chemical process industries for protein-based catalysts possessing novel 
capabilities. At present, this need is largely addressed using enzymes purified from 
a variety of cultivated bacteria or fungi. However, because iess than 1% of naturally- 
occurring microbes can be grown in pure culture (Amann. 1995), alternative 
techniques must be- developed to exploit the full breadth of microbial diversity for 
15 potentially valuable new products. 

Virtually all of the commercial enzymes now- in use have come from 
cultured organisms. Most of these organisms are bacteria or fungi. Amann et al 
(Amann, 1995) have estimated cultivated microorganisms in the environment as 
follows: 
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Habitat Culturabilitv (%) 

Seawater ' • 0.001-O.i 



Freshwater 



'0.25 



Mesotrophic' lake " ', ■ 0.01-1.0 

Unpolluted esturine waters • 0.1-3.0 

Activated sludge 1.0-15.0 



Sediments 
Soil' ■ ■. 



0.25 
■0!3 



1 .' These data were -determined from published information regarding the number ' 
10" of cultivated microorganisms derived from the various habitats;. indicated. . ■ 

' . • Other studies have also demonstrated that cultivated, organisms- comprise only 
• a small- fraction .of the biomass present in the environment. For example, one group • 
of. workers. recently reported the collection of water arid- sediment samples from the 
"Obsidian Pool" in' Yeliowstone.Nationat Park (Barns, 1994) where they found cells • 
15. hybridizing to archaea-spectfk probes . in 55% of. 75 enrichment: cultures. • 
, . . .Amplification and cloning' of 16S rRNA encoding sequences revealed mostly unique 
sequences with little, or no representation of the organisms which had previously been 
cultured from this pool suggesting the existence of substantial diversity of archaea 
• ■ with so far unknown morphological, physiological and biochemical features^ Another 
20 group performed similar studies on the cyanobacterial mat of Octopus Spring in 
Yellowstone Park and came to the same conclusion; namely, tremendous uncultured . 
diversity exists. (Ward, 1990). Giovannoni et al. (1990) and, Torsvik et cii (1990a) 
have reported similar results using bacterioplankton collected in the Sargasso Sea and 
in soil samples, respectively. - These results indicate that: the exclusive use of cultured 
25 organisms in screening for useful enzymatic or other bioactivities severely limits the 
sampling of the potential diversity in existence. 



WO 99/45154 PCT/US99/04917 

" J ' 

Screening of gene libraries from cultured samples has already proven valuable. 
It has recently been made clear, however, that the use of only cultured organisms for 
library generation limits access to the diversity, of nature. The uncultivated organisms 
present in the environment, and/or enzymes or other bioactivities derived thereof, may 

~5 be useful in industrial processes. The cultivation of each organism represented in any- 
given environmental sample would require significant time and effort. It has been 
estimated that in a rich sample of soil, more than 10,000 different species can be 
present. It is apparent that attempting to individually cultivate each of these species 
would be a cumbersome task. Therefore, novel methods of efficiently accessing the . 

10 diversity present in the environment are highly desirable. 

SUMMARY OF THE INVENTION 

The present invention addresses this need by providing methods 'to isolate the 
DNA from a variety of sources, including isolated organisms, consortias of 
microorganisms, primary enrichments, and environmental samples, to make libraries 
15 which have been "normalized" in their representation of the genome populations in the 
original samples, and to screen these libraries for enzyme and other bioactivities. 

The present invention represents a novel, recombinant approach to generate and 
screen DNA libraries constructed from mixed microbial populations of cultivated or, 
preferably, uncultivated (or "environmental") samples. . In accordance with the present 

20 invention, libraries with equivalent representation of genomes from microbes that can 
differ vastly in abundance in natural populations are generated and screened. This 
"normalization" approach reduces the redundancy of clones from abundant species and 
increases the representation of clones from rare species. , These normalized libraries 
allow for greater screening efficiency resulting in the isolation of genes encoding novel 

25 biological catalysts. 



5 



PCT/US99/04917 

WO 99/45154 

-4-' 

Screening of mixed populations of organisms has been - made a rational 
approach because of the availability of techniques described herein, whereas previously 
attempts at screening of mixed population were not feasible and were avoided because 
of the cumbersome procedures required. 

' Thus, 'in one aspect the invention provides a process for forming a normalized 
genomic DNA library from an environmental sample by (a) isolating a genomic DNA 
population from the environmental sample; (b) at least one of (i) amplifying the copy , 
.. number of the DNA population so isolated and (ii). recovering a fraction of the isolated 
' ' genomic DNA having a desired characteristic; and (c) normalizing the representation 
10 of various DNAs within the genomic DNA- population- so as to form , a normalized 
library of genomic DNA from the environmental sample. 

•. in one preferred embodiment 'of this aspect, the process comprises the step of 
' recovering a fraction of the isolated genomic DNA having a desired characteristic. 

In another preferred embodiment. of this aspect, the process comprises the step 
1 5 of amplifying: the copy number of the DNA population so isolated/ 

In another preferred embodiment of this aspect, the step of amplifying the 
genomic DNA precedes the normalizing step. In an alternate preferred embodiment 
of this aspect, the step of normalizing the genormc DNA precedes the amplifying step. 

20 In another preferred embodiment of this aspect, the process comprises both'the 

steps of (i) amplifying the copy number of the DNA population so isolated and (ii) 
recovering a fraction of the isolated genomic DNA having a desired characteristic. 

Another aspect of the invention provides a normalized genomic DNA library- 
formed from an environmental sample by a process comprising the steps of (a) 
25 isolating a genomic DNA population from the environmental sample; (b) at least one 
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of. (i) amplifying the copy number of the DNA population so isolated and (ii) 
recovering a fraction of the isolated genomic DNA having a desired- characteristic; and 
(c) normalizing the representation of various DNAs within the genomic DNA 
population so as to form a normalized library of genomic DNA from the 
5 environmental sample. The various preferred embodiments described with respect to 
the above method aspect of the invention are likewise applicable with regard to this 
aspect of the invention. 

The invention also provides a process for forming a normalized genomic DNA 
library from an environmental sample by (a) isolating a genomic DNA population 
10 from the environmental sample; (b) at least one of (i) amplifying the copy number of 
the DNA population so isolated and (ii) recovering a fraction of the isolated genomic 
DNA having a desired characteristic; and (c) normalizing the represeniaiion of various 
DNAs within the genomic DNA population so as ?o form a normalized library of 
genomic DNA from the environmental sample. 

15 Another aspect of the invention provides a normalized genomic DNA library 

formed from an environmental sample by a process comprising the steps of (a) 
isolating a genomic DNA population from the environmental sample; (b) at least one 
of (i) amplifying the copy .number of the DNA population so isolated and (ii) 
recovering a fraction of the isolated genomic DNA having a desired characteristic; and 

20- (c). normalizing, the representation of various DNAs within the, genomic DNA_ 
population so as to form ' a -normalized library of genomic DNA from the 
environmental sample. The various preferred embodiments described with respect to 
the above method aspect of the invention are likewise applicable with regard to this 
aspect of the invention. 



25 



BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 is a graph showing the percent of total DNA content represented by 
G + C in the various genomic DNA isolates tested as described in Example 2. 
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DETAILED DESCRIPTION OF THE INVENTION 
DNA ISOLATION: 

An important step in the generation of a normalized DNA library from an 
environmental sample is the preparation of nucleic acid from the sample. DNA .can 
5 be isolated from samples, using various techniques well known in the art {Nucleic 
Acids in the Environment Methods & Applications, J.T. Trevors, D.D. van Elsas, 
' Springer Laboratory', 1995). Preferably, DNA obtained will be of large size and free 
of enzyme inhibitors and other contaminants. DNA can be isolated directly from the 
environmental sample (direct lysis) or cells may be harvested from the sample prior 

10 to DNA recovery (cell separation). Direct lysis procedures have several advantages 
over protocols based on eel! separation. The direct lysis technique- provides more 
DNA with a generally higher representation of the microbial community, however, it 
is sometimes smaller in size and more likely to contain enzyme inhibitors than DNA 
recovered using the cell separation technique. Very useful direct lysis techniques have 

15 recently been described which provide DNA of high molecular weight and high purity 
(Barns, 1994; Holben." 1994). If inhibitors are present, there are several protocols 
which utilize cell isolation which can be employed (Holben, 1994). Additionally, a 
fractionation technique, such as the bis-benzimide separation (cesium chloride 
isolation) described below, can be used to enhance the purity of the DNA. t 

20 FRACTIONATION: 

Fractionation of the DNA samples prior to normalization increases the chances 
of cloning DNA from minor species from the pool of organisms sampled. In the 
present invention, DNA is preferably fractionated using -;a density centrifugation 
technique, One example of such a technique is a cesium-chloride gradient. 
25 Preferably, the technique is performed in the presence of a nucleic acid intercalating 
agent which will bind regions of the DNA and cause a change in the buoyant density 
of the nucleic acid. More preferably, the nucleic acid intercalating agent is a dye, 
such as bis-benzimide which will preferentially bind regions of DNA (AT in the case 
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of bis-benzimide) (Mullen 1975; Manuelidis, 1977). When nucleic acid complexed 
with an intercalating agent, such as bis-benzimide, is separated in an appropriate 
cesium-chloride gradient, the nucleic a :id is fractionated. If the intercalating agent 
preferentially binds regions of the DNA ; such as GC or AT' regions, the nucleic acid 
5 is separated based on relative base content in the DNA. Nucleic acid from multiple 
organisms can be separated in this manner. 

Density gradients are currently employed to fractionate nucleic acids. For 
example, the use of bis-benzimide density gradients for the separation of microbial 
•. nucleic acids for use in soil typing and bioremediation has b^en described. In these 

10 experiments, one evaluates the relative abundance of A :so peaks within fixed benzimide 
gradients before and after remediation treatment to see how the bacteria! populations 
have been affected. The technique relies on the premise that on the average, the GC 
content of a species is relatively consistent. This lechnique is applied in the present 
invention to fractionate complex mixtures of genomes. The nucleic acids derived from 

15 a sample are subjected to ultracentrifugation and fractionated while measuring the A 260 
as in the published procedures. 

In one aspect of the present invention, equal A 260 units are removed from each 
peak, the nucleic acid is amplified using a variety of amplification protocols known 
in the art, including those described hereafter; and gene libraries are prepared.- 
- -20- Alternatively, -equal- A 260 units .are removed, from each. peak, and 'gene libraries are 
prepared directly, from this nucleic acid. Thus, gene libraries are prepared from a 
combination of equal amounts of DNA from each peak. This strategy enables access 
to genes from minority' organisms within environmental samples and enrichments, 
whose genomes may not be represented or may even be lost, due to the fact that the 
25 organisms are present in such minor quantity, if a library was construed from the total 
unfractionated DNA sample. Alternatively, DNA can be normalized subsequent to 
fractionation, using techniques described hereafter. DNA libraries can then be 
; generated from this fractionated/normalized DNA. 
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The composition of multiple fractions of the fractionated nucleic acid can be 
determined using PGR related amplification methods of classification well known in 
the art. _ • 

NORMALIZATION: 

5 Previous normalization protocols' have been designed for constructing 

normalized cDNA libraries (WO 95/08647, WO 95/1 1986). these protocols were 
originally developed for the cloning and isolation of rare cDNA's derived from 
mRNA. The present invention relates to the generation of normalized genomic DNA 
gene libraries from uncultured or environmental samples. ; • . 

10 Nucleic acid samples" isolated directly from environmental, samples or from 

primary enrichment cultures will typically contain genomes from' -a large number of 
microorganisms. These complex communities o? organisms can be described by the 
absolute number of species present within a population and by the. relative abundance 
of each organisms within the sample. Total normalization of each organisms, within 

15 a sample is very. difficult to achieve. Separation techniques such as optical tweezers 
can be used to pick morphologically distinct members with a sample. Cells from each 
member can then be combined in equal numbers . or pure cultures of each member 
within a sample can be prepared and equal numbers of .cells from each pure culture 
combined to achieve, normalization,. In" practice, .this is very difficult to perform, 

20 especially in a high thru-put manner. 

: The present invention involves the use of techniques to approach normalization 
of the genomes present within an environmental sample, generating a. DNA library 
from the normalized nucleic acid, and screening the library for an activity of interest.- 

. In one aspect of the present invention, DNA is isolated from the sample and 
25 fractionated. The strands of nucleic acid are then melted and allowed to selectively 
reanneal under fixed conditions (C 0 t driven hybridization). Alternatively, DNA is not 
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fractionated prior to this melting process. When a mixture of nucleic acid fragments 
is melted and allowed to reanneal under stringent conditions, the common sequences 
find their complementary strands faster than the rare sequences. After an optional 
single-stranded nucleic acid isolation step, single-stranded nucleic acid, representing 
5 an enrichment of rare sequences, is amplified and used to generate gene libraries. This 
procedure leads to the amplification of rare or low abundance nucleic acid molecules. 
These molecules are then used to generate a library. While all DNA will be 
recovered, the identification of the organism originally containing the DNA may be 
lost. This method offers the ability to recover DNA from "unclonable sources." 

10 Nucleic acid samples derived using the previously described technique are 

amplified to complete the normalization process. For example, samples can be 
amplified using PCR amplification protocols such as those described by {Co et ai (Ko : 
1990b; Ko, 1990a, Takahashi. 1994), or more preferably: long PCR protocols such as 
those- described by Barnes (1994) or Cheng (1994). 

15 Normalization can be performed directly, or steps car also be taken to reduce 

the complexity of the nucleic acid pools prior to the normalization process. Such 
reduction in complexity can be beneficial in recovering nucleic acid from- the poorly 
represented organisms. 

_ ..The microorganisms .from .which the libraries may be prepared include 

20 prokaryotic microorganisms, such as Eubacteria and Archaebacteria, and lower 
eukaryotic microorganisms such as fungi, some algae and protozoa. The 
microorganisms may be cultured microorganisms or uncultured microorganisms 
obtained from environmental samples and such microorganisms may be extremophiles, 
such as thermophiles, hyperthermophiles, psychrophiles, psychrotrophs, etc. 

25 
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' As indicated above, the library may be produced from environmental 
' samples in- which case DNA may be recovered without culturing of an organism or ' 
the DNA may .be recovered from a cultured organism. 

■ ■ ' * 

; ' ' Sources of microorganism DNA as. a starting' material-library from which' 
5 target DNA is obtained are particularly contemplated to include environmental 

samples, such as microbial' samples obtained from Arctic and Antarctic ice/ water 
y or permafrost sources, materials of volcanic origin, materials from soil or plant 
sources. in tropical areas, etc. Thus, for example,, genomic DNA may be recovered 
from either a culturable or non-culturable organism and employed, tg produce' an 
10 appropriate recombinant expression library tor' subsequent determination of enzyme 
activity. '. " v ; . ■ ' ' ' * . '»„ ' 

. ■■ Bacteria and many eukaryotes have * coordinated mechanism for regulating 

genes whose products are involved in related processes. The genes are clustered, in 
structures referred, to as "gene clusters/ on a single chromosome and' are 

15 transcribed together under the control. of a single regulatory sequence, including a 
single promoter which initiates .transcription of the entire .cluster-. The gene cluster, 
the promoter, and. additional sequences that function in regulation altogether are 
referred to as an "operon"' and. can include up to 20. or more*genes ; usually from 2 
to 6 genes. Thus*, a gene cluster is 1 a group of adjacent genes that- are either 

20 identical or related, usually. as to their function. 1 . ' , ' 
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Some gene families consist of identical members. Clustering is a 
prerequisite for maintaining identity between genes, although clustered genes are 
not necessarily identical. Gene clusters range from extremes where a duplication is 
generated to adjacent related genes to cases where hundreds of identical genes lie in 



~~5~~a tandem array. Sometimes no significance is discernabie in a repetition of a 
particular gene. A principal example of this is the expressed duplicate insulin 
genes in some species, whereas a single insuiin gene is adequate in other 
mammalian species. 

It is important to further research gene clusters and the extent to which the 
10 full length of the cluster is necessary for the expression of the proieins resulting 
therefrom. Further, gene clusters undergo continual reorganization and, thus, the 
ability to create heterogeneous libraries of gene clusters from, tor example, 
bacterial or other prokaryote sources is valuable in determining sources of novel 
proteins, particularly including enzymes such as, for example, the polyketide 
15 synthases that are responsible for the. synthesis of polyketides having a vast array of 
useful activities. Other types of proteins that are the product(s) of gene clusters are 
also contemplated, including, for example, antibiotics, antivirals, antitumor agents 
and regulatory proteins, such as insulin. 

Polyketides are molecules which are an extremely rich source of 
20 bioactivities, including antibiotics (such as tetracyclines and erythromycin), anti- 
cancer agents (daunomycin), immunosuppressants (FK506 and rapamycin), and 
veterinary products (monensin). Many polyketides (produced by polyketide 
synthases) are valuable as therapeutic agents. Polyketide synthases are 
multifunctional enzymes that catalyze the biosynthesis of a huge variety of carbon 
25 chains differing in length and patterns of functionality and cyclization. Polyketide 
synthase genes fall into gene clusters and at least one type (designated type I) of 
polyketide synthases have large size genes and enzymes, complicating genetic 
manipulation and in vitro studies of these genes/proteins. 



V 
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The ability to select and combine desired components from a library of 
polyketides and post-pojyketide biosynthesis genes for generation of novel 
polyketides for study is appealing. The method(s) of the present invention make it 
possible to and facilitate the cloning of novel polyketide synthases, since one. can., 
5 generategene banks with .clones containing large inserts (especially -when using the 
■ f-factor based' vectors),, which facilitates- cloning of gene clusters. . 

' . Preferably, the gene cluster DNA is Heated into a vector, particularly' 

wherein a vector' further comprises expression regulatory sequences which can 
control and regulate the production of a detectable protein or protein-related array 

10 activity from the ii gated gene clusters. Use of vectors which have an exceptional 
large capacity for exogenous DNA introduction are particularly appropriate for use 
with such gene clusters "and are described by way of example herein to include the 
f-factor (of "fertility factor) of E. coli. This f-faetor of E. cdii is a plasmid which 
, affect high-frequency transfer of itself during conjugation and- is ideal to achieve- 

15 and stably propagate large DNA fragments, 'such as gene clusters from, mixed . 
microbial samples. ... ' , . 



LIBRARY SCREENING: , 

After normalized libraries have been generated,- unique enzymatic activities • 
■ can be discovered using a variety of solid- or liquid-phase screening assays in a 
20 variety of formats, including a high-throughput robotic format, described herein, 
.The normalization of the DNA used to construct, the libraries is a key component in' 
the process: Normalization^ \vill increase the representation of DNA from important 
organisms, including those represented in minor amounts in the sample. ■ _ ■ 
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Example 1 
DNA Isolation 

] . Samples are resuspended directly in the following buffer: . 
500mM Tris-HCl, pH 8.0 



5 lOOmMNaCl 

ImM sodium citrate 
100|ig/ml polyadenosine 
5mg/ml lysozyme 
2. Incubate at 37°C for 1 hour with occasional agitation. 
10 3. Digest with 2mg/ml Proteinase K -enzyme (Boehringer Mannheim) at 

37 3 C for 30 min. 

4. Add 8 mi of lysis buffer [200 mM Tris-HCl, pH S. 0/1 00 mM 

NaCl/4% fwt/vo!) SDS/10% (wt/voi) ^aminosalicylate) and mix 
gently by inversion. 

15. 5. Perform three cycles of freezing in a dry ice-ethanol barn and 

thawing in a 65 C C water bath to release nucleic acids. 

6. Extract the mixture with phenol and then phenol/chloroform/isoamyl 
alcohol. 

7. Add 4 grams of acid-washed polyvinylpolypyrrolidone (PVPP) to the 
20 aqueous phase and incubate 30 minutes at 37°C to remove organic 

contamination. 

8. Pellet PVPP and filter the- supernatant through a Q..45 \im membrane 
to remove residual PVPP. 

9. Precipitate nucleic acids with isopropyl alcohol. " 

. 25 10. Resuspend pellet in 500 |il TE (10 mM Tris-HCl s pH 8.0/1.0 mM 

EDTA) 

11. Add 0.1 g of ammonium acetate and centrifuge mixture at 4°C for 
30 minutes. 

12. Precipitate nucleic acids with isopropanol. 
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Example 2 

Bis-Benzimide Separation of DNA ' 

Sample composed of genomic DNA from Clostridium perfringens (27% y 
"G+C) s . Escherichia coli (49% GrC). and Micrococcus lysodictium (72% G+C) was - 
5 purified on a cesium-chloride gradient. The cesium chloride (Rf ".=;• 1.3980) solution 
was filtered through a. 0.2 \im filter and 15 ml were loaded into, a 35 ml OptiSeal 
tube (Beckman).. The DNA was added and thoroughly mixed. Ten micrograms of 
. bis-benzimide (Sigma; Hoechst 33258) 'were added and mixed thoroughly. The 
tube was then filled with the filtered cesium' chloride solution and spun in a VTi50 
10 rotor in a Beckman L8-70 Ultracentrifuge at 33.000 'rpm for 72 hours. Following, 
■ centrifugatiom a syringe pump and fractionator (Brandel Mode! 1 S6Y were, used to . 
drive the gradient through an^SCO UA-5 UV absorbance detector set to '280 nm. ' ■■' 
Three peaks representing the DNA from the [ores organisms were obtained, PGR* 
amplification of DNA encoding- rRN A from i 1 0- fold dilution of the-£. coli peak ■■ ■ 
15 was performed with the following primers to amplify eubacterial sequences: 

; 1 Forward primer: (27F). ' . ■ ." , 

5'-AGAGTTTGATCCTGGCT.CAG-3: ■ ' < • . 

Reverse primer: (1492R) 
,5; : GGTTACCTTGTTACGACTTo' 

20 Example 3 

Sample of DNA obtained from the gill tissue of a clam 
harboring an endosvmbiont which cannot be 

physically separated from its host -y 
1. Purify DNA on cesium chloride gradient according' to published' protocols ■ 
25 , (Sambrook, 1989). 

2. Prepare second cesium chloride solution; (Rf = 1.3980) filter through 
0.2|im filter and load 15ml into a 35ml OptiSeal tube (Beckman). 

3. Add 10(ig bis-benzimide (Sigma; Hoechst 33258) and mix. 
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4. Add 50ng purified DNA and mix thoroughly. 



Spin in a VTiSO rotor in a Beckman L8-70 Ultracentrifuge at 33,000 
rpm for 72 hours. 

Use syringe pump and fractionator (Brandel Model 186) to drive 



5 gradient through anISCO UA-5 UV absorbance detector set to 

280nm. ' , 

Example 4 
Complexity Analysis 

1. 16S rRNA analysis is used to analyze the complexity of the DNA recovered 
10 ' from environmental samples (Reysenbach. 1992; DeLong. 1992; Barns. 
1994) according to the protocol outlined in Example 1. 
* 2. Eubacterial sequences are amplified using the- following primers: 
Forward: 

5'-AGAGTTTGATCCTGGCTCAG-3' 

15 Reverse: 

5'-GGTTACCTTGTTACGACTT-3' 
Archaeal sequences are amplified using the following primers: 
Forward: 

5'GCGGATCCGCGGCCGCTGCACAYCTGGTYGATYCTGCCo' 

20 Reverse: . 

5'„-GACGGGCGGTGTGTRCA-3' (R=purine ? ; Y-pyrimidine) 
3. . Amplification reactions proceed as published. The reaction buffer 
used in the amplification, of the archaeal sequences includes 5% 
acetamide (Barns, 1994). 
25 4. The products of the amplification reactions are rendered blunt ended 

by incubation with Pfu DNA polymerase. 
5, * Blunt end ligation into the pCR-Script plasmid in the presence of 

Srfl restriction endonuclease according to the manufacturer's protocol 
(Strategene Cloning Systems). 
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6. Samples are- sequenced using standard sequencing protocols 

(reference) and the number of different sequences present in the 
sample is determined. ' ' ■ * , ' ' 

' Example 5 

5 '< . Normalization • 

Purified DNA is fractionated according to the bis-benzimide protocol' of 
Example (2), and recovered DNA is sheared or enzymatically digested to 3-6 kb 
fragments. Lone-linker primers are ligated and the DNA is sized selected. Size- 
selected. DNA is amplified by PGR, if necessar\\ • 

10 '; Normalization is then accomplished as follows:. : ■ , . 

• 1:. ' Double-stranded DNA sample is resuspended in hybridization buffer i'0. 
■M NaH : P0 4 . pH 6.8/0.32 M NaCl/1 mN4 'EDTA/0. !%' SDSV . ' • 
- 2/ Sample is overlaid. with -mineral oil and denatured by, boiling for 1 : 0 ; 
minutes.. ' , • . ,/ 

15 3. Sample is incubated at 68°C for 12-36 hours. t . 

.4. ■ ■' Double-stranded DNA is separated from single-stranded DNA\ 

according to standard protocols^Sambrook,, 1989) on hydroxyapatite 
: at 60°C. • :' • . ■'■ \ ♦ ■• 

, 5. ■ The single-stranded DNA fraction is desalted and amplified by PGR. 
20 ' 6. The process is repeated for several more rounds (up to. 5 or more). 

Example 6 * ; 
Library Construction , ' " 

1 . Genomic DNA dissolved in TE. buffer is vigorously passed through a 25 
gauge double-hubbed needle until the sheared fragments are in the desireid 
25 " size range. 

2. 1 DNA ends are "polished" or blunted with Mung Bean nuclease. 
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3. EcoRl restriction sites in the target DNA are protected with EcoRl 
methylase. 

4. EcoRl linkers [GGAATTCC] are ligated to the blunted/protected 
DNA using a very high molar ratio of linkers to target DNA. 

5. Linkers are cut back with EcoRl restriction endonuclease and the 
DNA is size fractionated using sucrose gradients. 

6. Target DNA is ligated to the aZAPII vector, packaged using in vitro 
lambda packing extracts, and grown in the appropriate E. coli XLI 
Blue host cell. 



10 Example 7 

Library' Screening 

The following is a representative example of a procedure for screening an, 
expression library prepared in accordance wi-h Example 6. 

The general procedures for testing for various chemical characteristics is 
15 generally applicable to substrates other than those specifically referred to in this 
Example. 

Screening for Activity. Plates of the library prepared as described in Example 6 
are used to multiply inoculate a single plate containing 200 \xL of LB Amp/Meth : 
glycerol in each well. This step is performed using the High Density Replicating 
20 Tool (HDRT) of the Beckman B.iomek with a 1% bleach, water, isopropanol, ^air- 
dry sterilization cycle between each inoculation. The single plate is grown for 2h 
at 37°C and is then used to inoculate two white 96-well Dynatech microliter 
daughter plates containing 250 |iL of LB Amp/Meth, glycerol in each well. The 
original single plate is'incubated at 37°C for 18h, then stored at -80°C. The two 
25 condensed daughter plates are incubated at 37°C also for 18 h. The condensed 
daughter plates are then heated at 70°C for 45 min. to kill the cells and inactivate 
the host E.coli enzymes. A stock solution of 5mg/mL morphourea phenylaianyl-7- 
amino-4-trifluoromethyl coumarin (MuPheAFC, the 'substrate') in DMSO is diluted 
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to 600 fiM with 50 mM pH 7.5 Hepes buffer containing 0.6 mg/mL of the 



detergent dodecyl maltoside. 



VN. k 9 




Mu-Phe,AFC 



Fifty jaL of the 600 uM MuPheAFC solution is added .to each of the wells of the 
5 white condensed plates with one 100 uf mix eye is using the Biom'ek to yield a . 
. final concentration of substrate of - 100 jiM. The rsuoresc.er.ee values are recorded 
(excitation = 400 nm. emission = 505 ami on a plate reading : fluorometer 
immediately after addition of the substrate (t=0). The plate is; incubated at 70°C 
for 100 min, then allowed to. coo! to ambient temperature for \5 additional 
.10 minutes. The fluorescence values are recorded again (t=l 00). The values at t=0 
are subtracted from the values at t=100 to determine if an active clone is present. . 

The data will indicate whether one of the clones in a particular well is 
hydrolyzing the substrate. In order to determine the individual clone which carries 
the activity, the source library plates are thawed and the individual clones are used 

15 to singly inoculate a new plate containing LB Amp/Meth, glycerol. As above, the 
plate is incubated at 37°G to grow the cells, heated at 70°C.to inactivate the host . 
enzymes, and 50 uL of 600 uM MuPheAFC is added using the Biomek. 
Additionally three other substrates are tested. They are methyl. umbelliferone 
heptanoate, the CBZ-arginine rhodamine derivative, and fluorescein-cpnjugated 

20 casein (-3.2 mol fluorescein per mol of casein). 
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methyl umbclliferone heptacoate 



The umbelliferone and rhodamine are added as 600 .\xM stock solutions in 50 of 
.Hepes buffer. ' The fluorescein conjugated casein is also added in 50 \xL at a stock 
• * concentration of 20 and 200 mg/mL. After addition of the substrates the t=0 

fluorescence values are recorded, the plate is incubated at 70°C. and the t=100 mm. 
5 values are recorded as above. 

These data indicate which plate the active clone is in. where the arginine 
rhodamine derivative is also turned' over by this activity, but the lipase substrate, 
methyl umbelliferone heptanoate, and protein, fluorescein-conjugated casein, do not 
function as substrates. 



10 Chiral amino esters may be determined using at least the following 

substrates: 



R " CM i 

CH-2-OH " 
CH 2 -C02" 




1 A °~° 



tt2 H NH: 



For each substrate which is turned over the enantioselectivity value, E, is 
determined according to the equation below: 

ln[(l-c(l+ee„)] 

15 E= 

ln[(l-c(l-ee p )] 
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where ee P = the enantiomeric excess (ee) of the hydrolyzed product and c = the 
percent conversion of the reaction. . See Wong and Whitesides, Enzymes in 
■ Synthetic Organic Chemistry, 1994, Elsevier, Tarrytown, New. York, pp. 9-12. 

' The enantiomeric excess is determined by either chiral high performance 
5 . liquid chromatography (HPLC) or chiral capillary electrophoresis (CE). Assays are 
performed as. follows: two hundred uL of the appropriate buffer is added to each 
'well of a 96-well white microliter plate, followed by 50 uL of partially or 
completely purified enzyme solution; 50 uL of substrate is added and the increase 
in fluorescence monitored versus time until 50% of the substrate is consumed or . 
10 the reaction stops, whichever comes first. ^ . ■ 

Example 8 

, fnnstruction of » Stable. Large Insert Piconlankton Cenomic DN'A Library , 

Cell collection and preparation of DNA. Agarose plugs containing 
concentrated picoplankton ceils were prepared from samples collected' on an . 
15 oceanographic cruise 'from. Newport, Oregon to Honolulu, Hawaii. ' Seawater (30 • •• 
. liters) was collected .in Miskin bottles., screened through .10 \im Nvtex, and . " 
' concentrated by' hollow fiber filtration (Amicon DC 10) through 3,0,000 MW cutoff 
' polyfuifone filters. The concentrated bacteriopiankton cells were collected on a 

0.22 p-m, 47 mm.Durapore filter, and resuspended in 1 ml of 2X ST.E buffer (1M 
20 NaCl, 0.1M EDTA, 10 mM Tris, pH 8.0) to a final density of approximately 1 x 
10'° cells per ml. The cell suspension was mixed with one volume of 1% molten 
Seaplaque LMP agarose (PMC) cooled to 40°C, and then immediately drawn into a 
1 ml syringe. The syringe was sealed with paraalm and placed on ice for 10 min. . 
' The cell-containing agarose plug was extruded into 10 ml of Lysis Buffer (lOmM ' 
25 Tris pH 8.0, 50 mM NaCl, 0.1M EDTA, 1% Sarkosyl, 0.2% sodium deoxycholate, 
1 mg/ml lysozyme) and incubated at 37°C for one hour. The agarose plug was 
then transferred to 40 mis of ESP Buffer (1% Sarkosyl, 1 mg/ml proteinase K, in 
0.5M EDTA), and incubated at 55°C for 16 hours. The solution was decanted and 
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replaced with fresh ESP Buffer, and incubated at 55°C for an additional hour. The 
agarose plugs were then placed in 50 mM EDTA and stored at 4°C shipboard for 
the duration of the oceanographic cruise. 

One slice of an agarose plug (72 (il) prepared from a sample collected off 
5 the Oregon coast was dialyzed overnight at 4 C C against -1 mL of buffer A (iOOmM 
NaCl, lOmM Bis Tris Propane*HCL 100 \ig/m\ acetylated BSA: pH 7.0 @ 25°C) 
in a 2 mL microcentrifuge tube. The solution was replaced with 250 (il of fresh 
buffer A containing 10 mM MgCl : and 1 mM DTT and incubated on a rocking 
platform for 1 hr at room temperature. The solution was then changed to 250 (il of 

10 the same buffer containing 4U of SauSAl (NEB), equilibrated to 37°C in a water , 
bath, and then incubated on a rocking platform in a 57°C incubator for 45 mm. 
The plug was transferred to a 1.5 ml microcentrifuge tube and incubated at 68 C C 
for 30 min to inactivate the enzyme and to mek the agarose. The agarose, was 
digested and the DNA dephosphorylased using Gelase and HK-phosphatase 

15 (Epicentre), respectively, according to the manufacturer's recommendations. 
Protein was removed by gentle phenol/chloroform extraction and the DNA was 
ethanol precipitated, pelleted, and then washed with 70% ethanoL This partially 
digested DNA was resuspended in sterile H 2 0 to a concentration of 2.5 ng/fil for 
ligation to the' pFOSl vector. 

- 20 PCR amplification results from several of the agarose plugs -(data not- 

shown) indicated the presence of significant amounts of archaeal DNA. 
Quantitative hybridization experiments using rRNA extracted from one sample, 
collected at 200 m of depth off the Oregon Coast, indicated that planktonic archaea 
in (this. assemblage comprised approximately 4.7% of the total picoplankton 
25 biomass (this sample corresponds to M PACI M -200 m in Table 1 of DeLong et a/., 
high abundance of Archaea in Antarctic marine picoplankton, Nature, 577:695-698, 
1994). Results from archaeal-biased rDNA PCR amplification performed on 
agarose plug lysates confirmed the presence of relatively large amounts of archaeal 
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- DNA in.- this sample. Agarose plugs prepared from this picoplankton sample were 
chosen for. subsequent fosmid library preparation. Each 1 ml agarose plug from 

■ this site contained approximately 7.5 x 10 5 cells, therefore approximately 5.4 x 1(1. 

' cells were present in tfie 72 |il slice, used in the preparation of the partially digested 
5 DNA. . ' "\ 

. '. Vector arms were prepared from pFOSl as described (Kim.et.aL Stable 
propagation, of casmid sized human DNA inserts in an F factor based vector, Nucl. 
Acids Res., 20:10832-10835; 1992). Briefly,- the piasmid was completely digested 
with Astll, dephosphorylated with HK phosphatase, and then digested, with BamHI 
10 to generate .two arms, each of which contained a coy '.site, in* the proper orientation 

* for cloning and packaging, ligated DNA -between 35-45 kbp. The partially digested 1 
' ,' picoplankton DNA was ligated overnight to the ?FOS Farms in a 15 \i\ ligation 

reaction containing 25 ng each of vector and insert and 1U of T4.DNA ligase - 
(Boehringer-Mannheim). The; ligated DNA in four microliters of this reaction was 
15 in vitro packaged using the Gigapack XL packaging system (Stratagene), the 

■ fosmid. particles' transfected to £ col: strain DH10B (BRL* and the cells spread 
onto LB cmI5 plates. "The" resultant fosmid clones were picked into 96- well microliter 

. dishes containing LB cm , ; supplemented vyitii 7% glycerol. Recombinant fosmids, . 

• each containing ca. 40 kb of picoplankton DNA insert,. yielded a .library of 3.552 
20 ; fosmid clones, containing approximately 1.4 x 10 s base pairs of* cloned DNA. All 

of the clones examined contained inserts ranging from 38 to 42 kbp, This library 
. was stored frozen at -80°C for later analysis. ; 

Numerous modifications and variations of the. present invention are; possible 
' in light of the above teachings; therefore, within the scope of the claims, the 
25 invention may be practiced other than as particularly described. 
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AA3 



- . A2 ' 1 ; ■ 

Hlu«wctcrtn ctin|upaieiJ casein f 3 -2 m»'l nu"rc5C*in/rn<jl casein) 
CBZ-AJi-AMC 

l-BOC-Al«-Ali-Aip-AMC ' 

succinyi-AUdy heu-AMC 

CBZ-A/g-AMC 

CBZ-Met-AMC 

oiorphoure j -The - AMC 

t-BCC = (-butoxy cirbonyt. CBZ = carSonyl benxyloiy. 
AMC - 7-trnmo-* -methyl coumann 



AB3 

Y - ' if ' ' 
•c . o 



AD3 

Fluorescein conjugated casein 

(-BOC- AU-.AU-A«p-AJ=C 
CBZ- Ali-AU-Ly«-AFC 
tuccinyi-Ala-AU-Phe-AFC^ 
auccLnyl-Ala-Gly-Lcu-AFC 

AFC = 7;-imino-4-trifluorcrneihyl cnumirin.) 



ACS 



V 

O 



. o : 



AE3 



Fluorescein conjugated 
casein 



AF3 

l-BOO Aia-AU-Axp-AFC 
CBZ-Asp-AFC 



. AG3 

CBZ AU Aii-Lys AFC 
CBZ Aig-AFC 



AH3 

fuccinyi-Ali-AU-l*hc-AFC 
CBZ Pt*- AFC 
CBZ-Trp-AFC 



* . AI3 

lucxinyl-Alat »ly I .eu*AK ' 

CBZ-Ala-AK: 

CDZ-Scwf-AK"- 
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Table 3 



LI 13 



T3 



And all ofL2 



LI3 




! ~OC 0 ~O 



U3 



C3H, 



LK3 



LM3 



LL3 ■ • 



LN3 



:OrCH,P* 




ro 



L03 



0-CX r Pn 



WO 99/45154 



PCT/US99/04917 



- 29 - 
Table 4 




4-methyI umbelliferone 
wherein R = 



/3-D-gaiactose 

/3-D-giucose 

/3-D-gJucuronidc 

/3-D-cellotriosid'e 

/3-B-cellobiopyranoside 

/3-D-gaIactose 

c*-D-gaiactose 

/3-D-giucose 

a-D-glucose 

/3-D-glucuronide 

^D-N.N^iacerylchitobiose 

/3-D-fucose 
a-L-fucose 
/3-L-fucose 
/3-D-mannose 
a-D-mannose ■ 



non-Umbelliferyl substrates 

amytosc [polyglucan «I,4 linkages], amylopectin 
Ipolyglucan branching a 1 ,6 linkages] 
xylanlpoly 1,4-D-xylan] 
amylopectin, pullulan . 
sucrose, fructofuranoside- 



G2 

GB3 

GC3 

GD3 

GE3 
GI3 
GJ3 

GK3 



GA3 

GF3 
GG3 
GH3 
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What Is Claimed Is: 

■ 1. A process for forming a normalized genomic DNA library from an 
' environmental sample, which' comprises the steps of: 

(a) isolating a genomic DNA population from the environmental sample; 
"5 (b) at . least- one of the steps selected from the group consisting of (i), 

amplifying the copy number of the DNA population so isolated, and (ii) recovering 
a fraction of the isolated genomic DNA having , a- desired characteristic; and 

. (c) normalizing, the- representation of various DNAs within the genomic * 
' ; DNA population so as to form a normalized library of genomic DNA from the , 
10' environmental sample. ' • : 

.2. The process of claim i which comprises the step of" recovering a fraction 
•of the isolated genomic DNA having 'a desired characteristic. 

■ 3. The process of claim, 1 which comprises the step of . •■ 
amplifying the copy .number, of the DNA population so isolated. 

15 4. The process of claim 1 wherein the step '.of amplifying the genomic 

DNA precedes the normalizing step. . , .. 

5. The process of claim 1 wherein the step of normalizing the genomic ■ 
DNA precedes the amplifying step. . 

20 ■; 6! The process of claim 1 which comprises both the steps of (i) amplifying \ , 

. the copy number of the DNA population so isolated and (ii) recovering, a fraction ; 
of the isolated genomic DNA having a desired characteristic • ' * 

7. A normalized genomic DNA library formed from from an environmental 
sample by a process comprising the steps of: 
25 , (a) isolating a genomic DNA population from the, environmental sample; 
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(b) at least one of (i) amplifying the copy number of the DNA population 
so isolated and (ii) recovering a fraction of the isolated genomic DNA having a 
desired characteristic; and 

' ( C ) normalizi ng the re presentation of various DNAs within the genomic 

DNA population so as to form a normalized library of genomic DNA from the 
environmental sample. 

8. The library of claim 1 wherein the process of forming said library 
comprises the step of recovering a fraction of the isolated genomic DNA having a 
desired characteristic. 

9. The library of claim 1 wherein the process of forming said library 
comprises the step of amplifying the copy number of the DNA population so 
isolated. 

10. The library of claim 1 wherein in the process of forming said library 
the step of amplifying the genomic DNA precedes the normalizing step. 

11. The library of claim 1 wherein in the process of forming said library 
the step of normalizing the genomic DNA precedes the amplifying step. 

- - - - - 12. The library of claim 1, wherein the process of forming said library 

comprises both the steps of (i) amplifying the copy number of the DNA population 
so isolated and (ii) recovering a fraction of the isolated genomic DN A having a 
desired characteristic. 
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13. A process for forming a normalized library of genomic gene clusters 
from an environmental sample which comprises 

(a) isolating a genomic DNA population from the environmental sample; 

(b) at least one of (i) amplifying the copy number of the DNA population 
5 so isolated and (ii) recovering a fraction of the isolated genomic DNA having a . 

desired characteristic; and 

(c) normalizing the representation of various DNAs within the genomic 
DNA population so as to form a normalized library of genomic DNA from the 
environmental sample. 

10 ' 14. A normalized library of genomic gene clusters formed from from an 

environmental sample by a process comprising the steps of 

(a) isolating a genomic DNA population from the environmental sample; 

(b) at least one of (i) amplifying the copy number of the DNA population 
■ so isolated and (ii) recovering a fraction of the isolated genomic DNA having a 

15 desired characteristic; . and t 

(c) ■ normalizing the representation of various DNAs within the genomic 
DNA population so as to form a normalized library of genomic DNA from the 
environmental sample. 
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SEQUENCE LISTING .• ; 

(1) GENERAL INFORMATION:. - \ ' 

(i) APPLICANTS :. DIVERSA CORPORATION 

(ii) ' > TITLE OF INVENTION: - . ■ ' ' ' ■ 

PRODUCTION AND. USE OF NORMALIZED DNA LIBRARIES 

(ill) NUMBER OF SEQUENCES: .10/ 

(iv) • 'CORRESPONDENCE ADDRESS : - 

(A) ADDRESSEE: Fish & Richardson , P. C . 

(B) , STREET:- 4225 Executive Square, Suite 1400 ; 
(CJ'CITY: San Diego * . 1 ■ 

- (DJ STATE: ' California '. 

(E) COUNTRY:' USA . ' . • 

(F; ' ZI? : t 32037 ^ . . 

(v> ' COMPUTER READABLE FORM : ■ 

(A) MEDIUM. TYPE: 3.5 INCH DISKETTE 

(B; COMPUTER: ■ IBM PS/2 ■ . ' /. . 

(C) ' OPERATING SYSTEM / . MS-DOS . 

' • '(D). SOFTWARE : ■ WORD PERFECT 5.1 / , ■ 

. ■ " (vi)' CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER:' 09/034', 724 

(B) FILING DATE: 04 March I99S ■ . * ' * 
■(C)' 'CLASSIFICATION-: Unassigned . 

: (vii) ' PRIOR APPLICATION" DATA: 
" (A) APPLICATION -NUM3ER: •' " • '. 

•. (3) FILING DATE: . • • ■ ' ' 

. '(C) CLASSIFICATION.: * 

■ (viii) ATTORNEY /'AGENT INFORMATION: 
'(A). NAME: HAILS, LISA A. 

(B) REGISTRATION NUMBER: 38,347 , * ' 

(C) REFERENCE /DOCKET NUMBER : O9010./033001 _ 

(ix) TELECOMMUNICATION INFORMATION:. . ' * 

(A) TELEPHONE: 619-678-5070 

(B) TELEFAX: 619-678-5099' < . . ' 

* • (2) INFORMATION -FOR SEQ "ID NO: I : . " ' ' - ; - ' • 

(i) ' SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: ' NUCLEIC ACID'- 

(C) STRANDEDNESS : SINGLE . 

(D) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGATTGAA GACCCTATGG AC 52 
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(2) INFORMATION FOR SEQ ID NO : 2 : , * 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES- 

(B) TYPE: NUCLEIC ACID 

(C) . STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: ■ cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGGAAGATCT TTAAGCACTT CTCTCAGGTT C 



(2) INFORMATION FOR SEQ ID NO : 3 : 

' (i) SEQUENCE CHARACTERISTICS ' 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: -cDNA ' . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGACAGG ( CTTGAAAAAG TA 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS 
'(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) t MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CGGAAGATCT TCAGCTAAGC TTCTCTAAGA A 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA ' ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO,: 5 : ' 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGTGGGAA TTAGACCCTA AA 



(2) INFORMATION FOR SEQ ID NO : 6 : 
■(-i-)— SEQUENCE-CHARACTERISTICS- 
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' " '(A). LENGTH: 31 NUCLEOTIDES 

(B) . TYPE.: NUCLEIC ACID 
. (C) , STRANDEDNESS : SINGLE , ' 

(D) . TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA ' . 

: (xi) SEQUENCE DESCRIPTION: SEQ . ID NO : 6 : ^ . 

1 CGGAGGATCC CTACACCTGT TTTTCAAGCT C • ■ . ■ 

(2) INFORMATION FOR SEQ ID NO : 7 : ' 

" (i) ' SEQUENCE CHARACTERISTICS . 

(A) LENGTH: 52 NUCLEOTIDES , • ■ • 

, (3) TYPE: NUCLEIC. ACID • 

(C) STRANDEDNESS: SINGLE \ , • 

(D) TOPOLOGY: LINEAR V : 

(ii) MOLECULE TYPE: CDNA * 
( {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7- . ' 
CCGACAATTG ATTAAAGAGG -GAAA7TAAC TATGACATAC TTAATGAACA AT ' ■ =>2 

(2) INFORMATION FOR SE^ ID NO : 3 : . . •, : 

(i) SEQUENCE 'CHARACTERISTICS 

■(A). LENGTH: 31 NUCLEOTIDES . ' . 

; (BJ 'TYPE : NUCLEIC ACID-' ' ' , 

(C) STRANDEDNESS: SINGLE '. 

(D) TOPOLOGY, : , LINEAR 

. (ii) . . ' MOLECULE TYPE : ■ 'cDNA ' . ■ , ■ ■■■■■ , - -. , 

(xi);-' ' SEQUENCE DESCRIPTION:- SEQ ID NO : 3 :' ' 
CGGAAGATCT TTATGAGAAG TCCCTTTCAA G , " . '< :'. 31 



(2) INFORMATION FOR SEQ ID NO: 9: . , ■ ... 

. (i) SEQUENCE. CHARACTERISTICS 

(A) LENGTH: , 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID ., 

'(C) STRANDEDNESS : ■ SINGLE . , *"■ , 

(D) TOPOLOGY: LINEAR '' ■ - _ 

(ii) * MOLECULE TYPE: ' CDNA . ' \ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCGGAAA CTGGCCGAGC GG 52 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH : 31 NUCLEOTIDES 
.(B) TYPE: NUCLEIC ACID 
■(C) STRANDEDNESS: SINGLE 



WO 99/45154 PCIYUS99/04917 

4 

(D) TOPOLOGY: LINEAR 
(ii) MOLECULE' TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: ' '.' • 

CGGAGGATCC TTAAAGTGCC GCTTCGATCA A 31 
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